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1 Fast detection of communication patterns in distribute d executi on s 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies oi 

Full text available: ^ pdf (4.21 MB) Additional Information: full citation, abstract , refere 

Understanding distributed applications is a tedious and difficult task. Visualizations based on proce 
of the execution of the application. The visualization tool we use is Poet, an event tracer developei 
very complex and do not provide the user with the desired overview of the application. In our exp< 
commun ... 

2 Illustrative risks to the public in the use o f com p uter systems and related technolo gy 
Peter G. Neumann 

January 1996 ACM SIGSOFT Software Engineering Notes, volume 21 issue 1 
Full text available: pdf(2.54 MB ) Additional Information: full cit ati o n 



Save results to a Binder 
^ Search Tips 

□ Open results in a new window 



3 RAID: h igh-performance, reliable secondary storage 

Peter M. Chen, Edward K. Lee, Garth A. Gibson, Randy H. Katz, David A. Patterson 
June 1994 ACM Computing Surveys (CSUR), Volume 26 issue 2 

Full text available: |p pdf(3.60 MB) Additional Information: full citation , abstract , refere 

Disk arrays were proposed in the 1980s as a way to use parallelism between multiple disks to imp 
lines of most major computer manufacturers. This article gives a comprehensive overview of disk . 
future work. First, the article introduces disk technology and reviews the driving forces that have [ 
the tw ... 

Keywords: RAID, disk array, parallel I/O, redundancy, storage, striping 



4 Highly available systems for database applications 
Won Kim 

March 1984 ACM Computing Surveys (CSUR), volume 16 issue 1 

Full text available: g pdf(2.43 MB) Additional Information: full citation , abstract , refere 

As users entrust more and more of their applications to computer systems, the need for systems t 
even greater. This paper presents a survey and analysis of representative architectures and techn 
systems for database applications. It then proposes a design of a distributed software subsystem I 
applica ... 

5 Session summaries from the 17th symposium on operating systems principle (SOSP'99) 
Jay Lepreau, Eric Eide 

April 2000 ACM SIGOPS Operating Systems Review, volume 34 issue 2 



Full text available: ^ j )pdf(3.15 MB) Additional Information: full citation , index terms 



6 DISP: Practical, efficient, secure and fault-tolerant distributed data storage 
Daniel Ellard, James Megquier 

December 2004 ACM Transactions on Storage (TOS), volume 1 issue 1 

Full text available: ^pdf(148.11 KB) Additional Information: full citation , abstract , refere 

DISP is a practical client-server protocol for the distributed storage of immutable data objects. Unl 
to make explicit tradeoffs between total storage space, computational overhead, and guarantees c 
Applications specify the degree of redundancy with which each item is encoded, what level of inte< 
items are stor ... 

Keywords: Distributed data storage 



7 BASE: Using abstraction to improve fault tolerance 
Miguel Castro, Rodrigo Rodrigues, Barbara Liskov 

August 2003 ACM Transactions on Computer Systems (TOCS), Volume 21 issue 3 

Full text available: pelf (438. 18 KB) Additional Information: full citation , abstract , refere 

Software errors are a major cause of outages and they are increasingly exploited in malicious atta 
some software errors but it is expensive to deploy. This paper describes a replication technique, B. 
tolerance and to improve its ability to mask software errors. BASE reduces cost because it enables 
availability ... 

Keywords: Byzantine fault tolerance, N-version programming, asynchronous systems, proactive i 



8 P ractical byzantine fault toleran ce and proactive recovery 
Miguel Castro, Barbara Liskov 

November 2002 ACM Transactions on Computer Systems (TOCS), volume 20 issue 4 

Full text available: ^pdf(1.63 MB) Additional Information: full citation , abstract , refere 

Our growing reliance on online services accessible on the Internet demands highly available systei 
bugs, operator mistakes, and malicious attacks are a major cause of service interruptions and the 
describes a new replication algorithm, BFT, that can be used to build highly available systems that 
implement re ... 

Keywords: Byzantine fault tolerance, asynchronous systems, proactive recovery, state machine r 



Reliability and security of RAID storage systems and D2D archives using SATA disk drives 
Gordon F. Hughes, Joseph F. Murray 

December 2004 ACM Transactions on Storage (TOS), volume 1 issue 1 

Full text available: |§ pdf(94.82 KB) Additional Information: full citation , abstract , refere 

Information storage reliability and security is addressed by using personal computer disk drives in 
cost of these serial ATA (SATA) PC drives is a tradeoff against drive reliability design and demonst 
Fibre Channel drives. This article discusses the tradeoff between SATA which has the advantage tt 
storage c ... 

Keywords: Disk drive, SATA, SMART, archival storage, failure prediction, secure erase, storage n 



10 Hive: fault containment for shared-memory multiprocessors 
J. Chapin, M. Rosenblum, S. Devine, T. Lahiri, D. Teodosiu, A. Gupta 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth ACI 

Full text available: ^_Qdf (1.90 MB) , Additional Information: full citation , references , citir 



11 Pilot: an operating system for a personal computer 

David D. Redell, Yogen K. Dalai, Thomas R. Horsley, Hugh C. Lauer, William C. Lynch, Paul R. McJom 
February 1980 Communications of the ACM, volume 23 issue 2 



Full text available: fgpd«1.14MB) 
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12 The HP AutoRAID hierarchical storage system 
John Wilkes, Richard Golding, Carl Staelin, Tim Sullivan 

February 1996 ACM Transactions on Computer Systems (TOCS), volume 14 issue l 

Full text available: Q pdf(1.82 MB) Additional Information: full citation , abstract , refere 

Configuring redundant disk arrays is a black art. To configure an array properly, a system adminis 
workload it will support. Incorrect understanding of either, or changes in the workload over time, < 
problem: a two-level storage hierarchy implemented inside a single disk-array controller. In the u| 
provide f ... 

Keywords: RAID, disk array, storage hierarchy 



1 3 Distributed operatin g systems 

Andrew S. Tanenbaum, Robbert Van Renesse 

December 1985 ACM Computing Surveys (CSUR), volume 17 issue 4 

Full text available: ^pdf( 5.49 MB) Additional Information: full citation , abstract, refere 

Distributed operating systems have many aspects in common with centralized ones, but they also 
distributed operating systems, and especially to current university research about them. After a di 
how it is distinguished from a computer network, various key design issues are discussed. Then se 
detail ... 

14 The HP AutoR A ID hierarchical stora g e system 
J. Wilkes, R. Golding, C. Staelin, T. Sullivan 

December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth ACI 

Full text available: ^ pdf( 1.60 MB) Additional Information: full citation , references, citir 



A Survey of Techniques for Synchronization and Recovery in Decentralized Computer Syste 
Walter H. Kohler 

June 1981 ACM Computing Surveys (CSUR), volume 13 issue 2 

Full text available: ^ pdf(3.33 MB) Additional Information: full citation , references , citings , index terms 



16 The TickerTAlP parallel RAID architecture 

Pei Cao, Swee Boon Lin, Shivakumar Venkataraman, John Wilkes 

August 1994 ACM Transactions on Computer Systems (TOCS), volume 12 issue 3 

Full text available: ^pdf(2.Q4 MB) Additional Information: full citation , abstract , refere 

Traditional disk arrays have a centralized architecture, with a single controller through which all re 
performance limits the maximum number of disks to which the array can scale. We describe Ticke 
controller functions across several loosely coupled processors. The result is better scalability, fault 

Keywords: RAID disk array, decentralized parity calculation, disk scheduling, distributed controlk 



Crosshatch disk array for improved reliability and performance 
S. w. Ng 



April 1994 ACM SIGARCH Computer Architecture News , Proceedings of the 21ST anr 

Volume 22 Issue 2 

Full text available: ^ pdf(955.28 KB) Additional Information: full citation , abstract , refere 

Redundant disk array architecture provides fault tolerance against disk drive failures. However, a ■ 
must also be controllers for interfacing with the disk drives, cabling for providing data/control path 
must also be able to tolerate failure in any one of these components. While currently known array 
the ... 

18 Experience Using Multiprocessor Systems — A Status Report 
Anita K. Jones, Peter Schwarz 

June 1980 ACM Computing Surveys (CSUR), volume 12 issue 2 

Full text available: ^| pdf(4.48 MB) Additional Information: full citation , references , citings , index terms 



19 Fault Tolerant Operating Systems 
Peter J. Denning 

December 1976 ACM Computing Surveys (CSUR), volume 8 issue 4 

Fuli text available: ^ pdf(2.69 MB) Additional Information: full citation , references , citings , index 



20 A taxonomy of computer program security flaws 

Carl E. Landwehr, Alan R. Bull, John P. McDermott, William S. Choi 
September 1994 ACM Computing Surveys (CSUR), volume 26 issue 3 

Full text available: ^| pdf(3.81 MB ) Additional Information: full citation , abstract , refere 

An organized record of actual flaws can be useful to computer system designers, programmers, ar 
for computer program security flaws, with an Appendix that documents 50 actual security flaws. T 
literature, but in widely separated places. For those new to the field of computer security, they pre 
how they ... 
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